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FOREWORD 



This publication, entitled I.Q. Tests and Minority Chil- 
dren ^ demonstrates point by point the inadequacies of IQ 
testing for Spanish-speaking children ^and children of other 
^minoTities. 

Designed for use by educators at every level, this edition 
provides a good deal of useful information about tests of in- 
telligence based on translations, ethnic norms, and other ele- 
ments that are not equally familiar to minority group children. 

The authors of this publication not only present the prob- 
lems involved in IQ testing, they also provide suggestions for 
solvir^ these problems; for example by implementing specific 
educational programs with computer-based models, such as the 
Program Assessment Pupil Instruction (PAPI) which uses Piagetian- 
based measures specifically designed for each child in the class- 
room . 

This book and others have been developed by the National 
Multilingual Assessment Program, a specially funded project un- 
der P.L. 89-10, Elementary and Secondary Act, as amended. Title 
VII. . 

The Dissemination Center for Bilingual Bicultural Education 
is indebted to the Multilingual Assessment Program, Joe Ulibarri , 
Director, for allowing us to publish and disseminate the first 
edition of this book. We feel that this work will help to effect 
needed cha^nges in the method by which achievement is measured in 
minority group children, especially Mexican-American students. 

Requests for information concerning this book and other bi- 
lingual materials should be addressed to the Dissemination Center 
for Bilingual Bicultural Education, 6504 Tracor Lane, Austin, ^ 
Texas 78721. 



Juan D. Soils 
Director 



/ 



/ 

ACKNOWLEDGEMENTS 



The authors wish to express their deep 'gratitude to 
Dr* Juan Pascual-Leone of York University, Toronto, 
, Ontario, Canada, for his critical review of several 
drafts and for the use of his Water Level and Figu- 
ral Intersection tests in the test battery. They 
would also like to acknowledge Veronica Hooker •s 
work on the computerized classroom activities. 



/ 



ERIC 



8 



TRADITIONAL TESTS 

Traditional tests of ■ intelligence are inappropriate 
for minority children, particularly children of non- 
English speaking backgrounds. Such diverse groups as 
the- National Education Association, popular press, courts 
civil rights organizations, state and federal agencies 
and school psychologists have all pointed to the failure 
of the test publishing industry to fully consider the 
cultural and linguistic differences of minority children 
when constructing, publishing and selling tests. 

Publishers have responded to this criticism by: 

1) translating existing intelligence tests for 
non-English speaking children; 

2) adjusting norms for ethnic sub-groups J 

3) attempting to construct culture-free tests. 
There are distinct problems with each of these 

approaches. In addition, there are problems concerning 
the basic validity and utility of information produced 
by IQ tests. A discussion of the inadequacy of the 
response of test publishers and a presentation of sev- 
eral other issues follows. 
TRANSLATIONS 

Translating existing intelligence tests for non- 
English speaking children often creates more problems 
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than it solves. Regional differenci^s within a larguage 
make it almost impossible to use a single translation. 
Thus, ^ while the word "toston" refers to f half 
dollar for a Ghicano child, for a Puerto Rican it refers 
to a squashed section of a fried banana. Mono-lingual 
translations are also inappropriate because the language 
familiar to non-English speaking children is often a 
combination of two languages as in the case of "pocho" 

"tex-mex". Furthermore, many non-English speaking ■ 

children have never learned to read in their spoken 
language. One finds many examples of tests written in 
Spanish being given to Chi no children who may speak 
Spanish but have had no prior ins;)truction in reading 
Spanish. ^ 

Another problem in translating tests is that the 
direct translation of a word or phrase in one language 
ra*y result in a word which is not used with the same 
4 frequency or have the same potency in the second lan- 
guage. For example, the word pet is a common word in 
E^S^sh. Its Spanish equivalent, animal domestico, is 
almost never used. Also, {translating a word from one 
language to another can vastly alter its meaning, like 
the wide variety of seemingly harmless English words 



ERLC 



2 

10 



/ 



which translate into Spanish swear words or "palabras ver- 
des". Thus, trans lating^a. large egg into a "huevon" may 
satisfy grammatical -requirements and seem harmless to 
a translator, but it has a more earthy connotation for 
Chicanes and Puerto Ricans. 

ETHNIC NORMS ^ 

The second major response of the testing industry 
to criticism has been to establish or propose re- 
establishment of regional and ethnic norms; in other words, 
to "compensate" minority children! for their "deprived 
backgrounds". Not only will such\a practice lead to 
lower expectations for minorities Vwhich, in turn, lowers 

children's aspirations to succeed), \ it is as shortsighted 

/ \ 
as awarding Chicane children extra points because "they 

\ , I 

speak a little Spanish." Ethnic norms take no account ! 

\ ' ^ ! 

of the '<;omplex reasons vhy mino?ity children on average! 

score lower than Anglos on IQ, and they are potentially 

dangerous because they provide a basis for invidious 

comparisons between different racial groups. The ten-' 

dency is to assume that lower scores are ultimately 

indicative of lower potential, thereby contributing to 

the self-fulfilling prophecy of lower expectations for 

minorities as well as reinforcing the genetic inferior- 



ity argument advanced by, Jensen (1971) and others. 

Furthermore, if test publishers and usees ^e 
willing to consider the establishment of ethnic norms, 
similar arguments could be made for the establishment 
of norms based on sex differences as well. Considering 
sex and ethnicity would require an almost infinite set 
of different inorm tables in order to account for all~of 
the different ^thnic subgroups in the United States. 
From the pract^ical point alone this leads to an absur- 
dity. Finallj^, the establishment of ethnic norms assumes 
that the groups are ethnically homogeneous witli little 
or no cross-over or intermarriage. One might wonder 
what publishers would propose to do with a set of male/ 
female twins who had a Mexican father and a Hungarian 
mother? 

CULTURE- FREE TESTS 

Another way in which the testing industry has 
responded to criticism of conventional IQ tests has been 
to create"culture^ree" tests. Such tests are difficul\ 
if not impossible, to construct. In tests of mental 
ability an attempt is made to determine the ability of 
a child to manipulate certain elements of a problem into 
a predetermined solution. But if a^l or some of the 
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elements are not equally familiar to the child, the 
test is unfairly biased. The influence of culture on 
conventional IQ test items is subtle in some cases, 
blatant in others • But the fact remains that in a large 
numxSer! of tradition *^,sts, the items are measuring 

something other th^*. -nat fo^ which they were designed. 
Items ' particularly influenced by cultural factors fall 

'/into the f611owing general classifications: 

\ 

Socialization^ Items of this type are pouched in 

\ ft- 
such a way as to actually be measures of the child's 

family value system. The referent system is, of course, 
the dominant Anglo-American middleclass. Th^ confounding 
effects of this problem are particularly evident in the 
"Comprehension" scale of the Weschler Intelligence Scale 
for Children (WISC) where children were asked such ques- 
tions as: ^ V 

"What^ is the thing to do if you lose one of 
your friend's balls? or "What is the ^thing 
to do if a fellow much smaller than yourself 
starts a fight?" 
Allowing for the stilted manner in which the ques- 
tions are phrased and assuming that ihe child knows all 
of the vocabulary, it still seem^ perfectly obvious that 
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this type of question \^ little or nothing to do with 
a child's ability to process, manipulate and/or code 
information; the answers depend almost exclusively on ' 
whether a child has been socialized under the particular 
ethical system implied by the question. 

Productivity or Level of Aspiration. Many tests 
confound what they 'hope "^o measure with a measure of 
productivity (i>r ' level of aspiration. For example, in 
a larg^ number of tests the child who produces the lar- 
gest number of responses is rewarded while the child who 
stops responding after only a few attempts is punished 
by receiving a lower score. Thus, in the Draw-A-Man 
test the child who produces the more elaborate figure 
stands the better cShahce of receiving the higher sdore, 
the assumption being that all subjects will produce as 
many responses as they are able--that they all have the 
same level of aspiration. ' 

Timed tests and "endurance" tests also fall prey 
to a confusion between the measurement of ability and 
the measurement of aspiration. In timed tests, which 
constitute the majority of publi/shed group tests, the 
children are asked to work quicicly, quietly and efficient 
^,ly wijthout regard for the child who is simply not in a 
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hurry and not particularly motivated to be so. The 
endurance test, for the purpose of boosting statistical 
reliability, requires that the child answer a large number 
of questions which vary lifdle in content. This problem 
is particularly evident in group tests such as the Lorge- 
Thorndike and the California Test Bureau series. Similar 

to the "endurance" test is the test where items are se- 

V 

quenced*^ in order of increasing difficulty. This design 
feature characterizes most of the standardized tests. 
In this situation the phild is forced — by design — to 
encounter increasing levels of failure and frustration. 
In the case of the child who starts out fearfully, as 
do most minority children, the first indication of failure 
or difficulty is .enough to discourage hijn or her from 
continuing and one finds the child "staring blankly off 
into space". 

Experience or Specific Learning . In tests which 
require answers of fact, there is an implicit assumption 
that all the children taking the test will have had an 
even chance, more or less, of having been exposed to the 
facts being tested. The spuriousness of this assumption 
is witnessed by any number of examples where children 
are asked questions of vocabulary. In such instances, / 
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it is imposoible to determine whether a minority child 
has missed a test item because he lacks the capacity to 
understand a given word or because he simply has never 
been exposed to the word. Nitroglycerin (in the WISC), 
fire hydrant (in the Bettj/ Caldwell Preschool Inventory 
and Peabody Picture Vocabulary Test) or crevice (in the 
Otis-Lennon) are terms unlikely to be heard or used in 
the home of the average" low SES or minority child* 

The WISC, perhaps the most widely-used individually- 
administered intelligence test in the world, is replete 
wi^h examples of the ojnportance of specific experience 
on test results. Take the WISC "General! Information" 
item, "In what kind of store do we buy ,6ugar?" If a 
child lives in an Indian reservation, he might buy his 
sugar in a drug store or at a trading post. A Chicano 
child might reply to that question, "at the Chinitos", 
a small varietj^ shop owned by a Chinese family. Yet 
these are not acceptable responses for credit in the 
Wise, Consider a^iother WISC item, "Where is Chile?" 
What if a child eats chile, as a Chicano child might. 
Or consider the item, "What is the thing to do if you ^, 
lose one of your friend's balls?" The acceptable WISC\^/ 
responses are "Give him one of mine. . ,try to get it ■ 
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back or replace it, . .or try to find it." A probable 
reaction of a Barrio or inner city child (depending on 
the size of the other child) would be avoidance, to es- 
cape the child's anger. That is not to say that these 
kinds of experience-based items are only found in the 
Wise scale. They are examples of these kinds of biases 
found in almost all tests of intelligence, 
THE VALIDITY AND UTILITY OF THE I^ SCORE 

The basic justification behind the use of the IQ 
score is that it statistically predicts^to mental retar- 
dation and low achievement. In fact, the IQ test is 
the sine quo non for screening children suspected of men- 
tal retardation* Mercer (1971) found that of those per- 
sons who would have been labelled as mentally retarded 
if their classification depended solely on test st!ores, 
a full 84% had completed 8 grades or more in school, 83% 
had held a job, 8 0% were financially independent or a 
housewife, and almost 100% were able to do their own 
shopping and to travel alone. In other words, it is 
probable that even at the task for which experts agree 
the IQ test is best suited — screening for mental retar- 
dation--the IQ measure has a dubious real life validity. 
The IQ test is also considered by many educators 
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and politicians to be a useful instrument for teachers-- 
for discovering unnoticed learning problems or intellec- 
tual strengths, for example. Indeed, many states mandate 
that districts administer IQ tests several times in a 
child's succession through the school system. But does 
the result really help the teacher? Let us take a typi- 
cal example. A teacher suspects a child of having a severe 
learning disability. She asks the school or district 
psychologist to test the child. The psychologist gives 
the ck^ld the WISC in which he scores, let us say, an 
IQ of bV. This psychologist happens to be extremely 
conscientious, so he devotes the. next few days to writing 

up an extensive report of his impressions of the child's 

i 

performance and potential. The child comes from a poor 
background, the psychologist writes. He has many isibs 
and is low in the hierarchy with regard to getting atten- 
tion. After much reviewing the child's school file and 
his own notes, the psychologist writes a report and hands 
it to the teacher. The teacher responds in surprise, 
"But I knew all that. What I want to know is what to 
do; how can I teach this child." Thus, in- most cases 
neither the psychologist nor the teacher are any wiser 
despite considerable tjjne and expense administering and 
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evaluating the IQ test. In this situation the psycholo- 
gist often finds him or herself in the middle of the 
delicate quasi political-asocial balance between the 

classroom teacher, the principal, the district adminis- 

\ 

tration, the child anA his family. 

While few psychologists would agree with the notion 
that educational decisions affecting the life of the 
child should be made exclusively on the basis of a single 
full scale IQ score, the fact nevertheless remains that 
these decisions are made by educators who, through per- 
sonal fiat supported by state mandate, ignore both the 
individual sub-scale profiles as well as psychologists* 
admonitions, simply for the sake of practical expediency. 
The results is, of course, a form of default institutional 
racism. ^ » 

INFORMATIONAL NEEDS WtTHIN THE x EDUCATIONAL SYSTEM AND THE 
12 TESTS 

Much of the controversy surrounding IQ tests and 
minority children focuses on whether the IQ model is a 
valid one. A more practical, and less abstract issue 
which needs to be answered concerns the general utility 
of the information produced by the test, i.\e., what can 
one do with the information provided by such a test? In 
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order to answer this question we must consider who is 
asking the question and for what purpose. ' It should be 
apparent that there are qualitative differences concerning 
the type of information needed within the educational • 
system. These differences depend on the source of the 
need. To a large extent, much of the confusion surround- 
ing the issue of whether to test stems from a failure 
to consider these differences. A consideration of the 
qualitative difference between these different needs 
will hopefully lend some clarity to the general contro- 
ver;§y as well as serve to introduce a procedure that 
attempts to meet some of the specific needs of each. 

There are three general levels of organization within 
the educational system that require information tradition- 
ally obtained through IQ testing. These three may be 
.described as (1) the funding level which involves edu- 
cational agencies supra-ordinate to the local school 
districts; (2) th.e local level which consists of both' 
district personnel and building-principals and finally; 

(3) the plassroom teacher, paraprof essional and parent. 

\ 

Whether or not the IQ score information can serve any 
ultimate useful function will depend on the particular 
and peciuliar needs of each of these three groups. It j 

\ 
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is unfortunate that historically all three attempt to 
use the IQ score for radically different and often 
conflicting purposes. 

Consider first the supra-ordinate funding agencies 
such as state and federal departments of education. For 
purposes of determining the allocation of funds, these 
agencies 3?equire information concerning educational and 
program needs at the school district level. While one 
source of information they generally use are IQ measures, 
one wonders whether the information produced by IQ tests 
adds anything of value to needs assessment. More appro- 
priate, "needs assessment" procedures, it would seem, 
would be restricted to the assessment of whether specific 
educational programs in areas of reading, arithmetic or the 
like are needed rather than attempting to infer specific 
need from omnibus assessments based on so poorly under- 
stood a concept as IQ. # 

The supra-ordinate agencies have a second related 
informational need. In contrast to needs assessments 
which very often can be conducted through examinations 
of school attendance records, age-grade placement patterns 
and achievement data in the broad sense, funding agencies 
need to know about the effectiveness of particular programs 



■Ml 



1/ 



Due to "accountability" and "evaluation/audit" 
requirements, the agencies chiefly responsible for the 
allocation of funds have mandated that testing be con- 
ducted at the child level as a means of collecting in- 
formation which can be used to evaluate educational pro- 
grams. In actuality, program evaluations can be made through 
p. variety of means or procedures, none of which necessar- 
ily has anything to do with IQ or summary-score types 
of measures discussed above. For example, a reasonable 
assessment of program effectiveness can be made through 
the collection of nominal data at vkrious levels of school 
and community organizations. Thus, administrators, teachers, 
parents and children can be interviewed as to their per- 
ceptions of the effectiveness of the different processes 
of the program. On the other hand, more product -or iented 
assessments, at the child level, can be made according 
to the specific educat'ionar^obj ective of the program. 
The reporting of these latter data can be made on the 
basis of group change scores, without reference to indi- 
vidual scores, eliminating the potential dangers inherent 
in individual scores. Finally, all of the various data 
can be integrated to produce a report which describes 
the program and weights the relative importance of each 



aspect of the prograiA in reconunending changes in various 
par1:s of the total operation. 

It is interesting to note that there is a paradoxical 
element in using IQ scores to evaluate program-related 
change. The paradox stems from the fact that the IQ model 
is based on the notion that intelligence is static and 
hence not subject to change. What this means is that 
educators have been using IQ scores to evaluate change 
which can not occur according to the IQ model. The IQ 
model, which negates time or age through the division 
of mental age by chronological age, is by definition a 
static model and, therefore, inappropriate for measuring 
program-related change which must take place over time. 

At the local level, school district personnel need 
information as to the needs of children and the effective- 
ness of individual programs in the same way as do- the 
supra -ordinate agencies. However, needs assessments are 
usually conducted at the state level and school officers 
more often than not use the state-provided information 
as a statement of need, preferring not to conduct detailed 
expensive in-house initiated research. 

Ideally, the evaluation of individual programs should 
center around the collection of data which follows directly 
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from the particular program objectives and activities. 
However, it is often the case that instruments of evalu- 
ation have very little to do with actual programs. Rather 
than program-specific measures, very often IQ or nationally 
normed achievement tests are used. Since IQ and other 
such scores provide precious little in the way of infor- 
mation about the effectiveness of individual programs 
and program components, there results great confusion 
in interpreting program evaluations. For example, in 
comparison with an appropriate "control group" suppose 
that children who participated in a special program on 
arithmetic computation had received higher scores on a group 
IQ score than they had the year before. With this type 
of evaluation officials would be hard put to say exactly 
which aspects of the program had led to the improved IQ 
scores since the connection between the two remains either 
ill-defined or nonexistent. However, in contrast, a test 
(administered before and after a specific program) which 
sampled from each of the different aspects of the program 
would provide specific information o^ which 'aspects of 
the program led to improved scores Jnfl which did not. 
In other words using IQ scores to evaluate programs not 
specifically designed for the purpose of elevating IQ 
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scores is a little like using a ther^mometer to record 
height changes in a given period of time. 

Unfortunately, the last to be consiv^ered in the edu- 
cational hierarchy is the classroom teacher and what she 
needs in order to assist the learner. What can the teacher 
do with an IQ score? The limited utility of the IQ or 
for that matter any other summary score whether it be 
reading level according to grade equivalent score, grade 
point average, percentile rank or the like has already 
been discussed. Perhaps the fact that to a large extent 
the tests do not provide the teacher with much usable 
information is historically related to the fact that 
psychological tests were initially developed for purposes ■ 
not functionally related to the present daily needs of 
the classroom teacher. It is also of some interest to note 
that even to the present day it is not the classroom teachers 
who either design the formal^ of the information produced 
by the testing or, select the speciyic test or tests which 
are to be used. 

" There is still another way in which the use of IQ 
scores in the cl<assroom might be questioned. Given it 
is possible to avoid the problems associated with adopting 
tests which employ the IQ model there is a final question 
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at the teacher-level concerning the ultimate functional 
use of the IQ score. If it is given that the IQ score 
is an accurate index of the children in a classroom, 
then what? What a teacher is asked to do is to discrim- 
inate between scores in designing curriculum for each 
and evet^jL^hild on the basis of inf ormation^hich she 
can not discriminate. What would a teacher do differently 
for a child with an IQ of 92 that she would or would not 
do for a child with an IQ of 100? One might agree that 
these two scores are functionally equivalent because 
they are both within the normal range (i.e., within one 
standard deviation of the mean). However, what about 
the case of a child who has an IQ score of 84 which is 
approximately^ a standard deviation below the mean and 
in some states considered as possibly falling into *'EMR", 
! "retarded" or "slow learner" categories? In the one 
case where the comparison is made between 92 and 100 
there would be no difference in educational recommendation, 
however, in the other case where scores of 92 and 84 are 
compared, very different educational recommendations would 
be made. In both cases there is the same absolute dif- 
ference in scores. 

The point is that we have been basing educational 
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decisiono on IQ tests which often produce questionable 
results. In many cases, the same criticisms apply tc 
achievement • tests which" provide collapsef^ scores. Func- 
tionally, what programmatic difference can the teacher infer 
from scores provided in grade equivalent terms. What 
are the different educational decision'^ to be drawn from 
reading grade equivalency scpres of 3.»2 versus 3,6 and 
3.6 versus 4.,0? In the case of both the IQ score and 
the collapsed achievemenv scorfi, the teacher, who is^ 
ultimately '^the person held "accountable", is simply not 
being provided with enough ihformation upon which to base 
sound daily educational decisions. The teacher is thus 
by default forced to make arbitrary decisions which in 
most cases can xiot be defended. The fact that the tests 
have "predict ive- validity" in that they forecast failure 
on the part of Spanish-speaking or other minorities and 
success for the majority only speaks to the inappropriate- 
ness of the curriculum and to the failure of the American 
educational system to provider education for many of its 
children . 

PAPI; A PIAGETIAN-BASED ALTERNATIVE 

In the following, a computer-based jr,odel will be 
described which was designed with many of the above con- 



ERIC 



19 



27 



\ 



i 



cerns in mind. This model has been successfully tested 
in a variety of ways in three Southwestern and two Western 
states. The discussion" provided below will outline the 
elements of the model. More substantive discussion con- 
cerning the psychometric features of the measures the 
model employs are subjects of several oth^r articles 
currently in the journals or b^ing prepared for publication. 

The program Assessment Pupil Instruction (PAPI) system 
uses a number of . Piagetian-based measures to generate 
two basic types of information by means of a centralized 
computer data-processing prog-ram. 

The first type of information provided by the PAPI 
system is statistical in nature and is meant for program 
evaluation at the funding, administrative and district 
levels. The second type of information is orien/ed towards 
the needs of the teacher and consists of recommended activi- 
ties specifically designed for each child in the class- 
room. 

In its present form, PAPI uses four Piagetian\- based 
measures developed jointly and individually by Pascual- 
Leone and DeAvila. Two types of measures are used. The 
first is used as a base measure of achievement where in- 
formation is collected which follows from classroom 
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activity. The second type 5f measure attempts to provide 

an index of the child's current level of development and 

involves three different tests. 

In the following section, each of the four tests in 

the battery is described. The first to be described is 

the measure employed as the basic achievement index. The 

following three are used to provide an index of ievel 

of development. The description of the test^ is followed 

by a more detailed description of the PAPI system. 

1. CARTOON CONSERVATION SCALES (CCS) 

Several measures of Piaget's conservation tasks 
are assessed by means of a cartoon format developed 
by De Avila, Struthors and Randall (1969). In 
De Avila 's procedure, three horizontal cartoons 
are presented in which two children are discussing' 
the task. In the first frame an equality is^ 
established between two objects according to the 
dimension being tapped (i.e., number, length, sub- 
stance, etc.). In the second frame an identity 
transformation is depicted and in the third frame 
the question of equality is asked. On the right 
side of the panel tliree possible answers are presented 
in a vertipal order.. The three alternatives which 
show the characters responding to the question are ^ 
randomly ordered as to correctness in order to avoid 
position effecrts. Similarly, within, a scale wording 
is altered from panel to panel in order to avoid the 
possi]:)le effects of acquiescence. Background on 
the conservation scales and an illustration of the • 
dialogie from each scale are presented below. 

Struthers and De Avila (1967) and De Avila et al 
• (1969) and others have validated the CCS procedure 
in a variety of ways. In one study De Avila, et 
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al (1969) tested thirty male and female first 
grade subjects using both^group and individual clin- 
ical methods. Statistically significant correlations 
were obtained across the two methods -of assessment 
(r = 0.663), The scales also possessed a high 
degree of internal consistency as shown by an 
examination of factor analytic structure and 
reliability indices. 

In its current f orm 't-he CCS consists of thirty - 
cartoon panels; six examples of five tasks. The 
panels are presented to the subjects and the story 
line is read and elaborated upon in order to facilitate 
understanding of the question. The subjects task 
is to mark the one (alternative) "that makes the 
story true." ^ 

2. WATER LEVEL TASK (WLT) 

The conservation of the horizontality of water 
described by Piaget and Inhelder (1948), and used 
by Smedslund (1963), Dodwell (1963), Rebelsky (1964), 
Beilin, Kagan and Rabinowitz (1966) and Pascual- 
Leone (in press). This task involves the subject 
being able to break perceptual set to recognize 
that no matter what angle a bottle placed gj^ a, f.lat 
horizontal surface is viewed from, the water level 
will always be parallel to the horizontal surface. 
A more complete description of the relative param- 
eters of this type of task can be found in the 
semantic-pragmatic analysis of Pascual-Leone (in 
press) . 

A special version of Pascual-Leone * s ^'group tests 
developed by Pascual-Leone and De Avil^ is used. 
Subjects are presented with individual booklets 
which contain five horizontal or vertical tvo- ^ 
dimensional bottles, eight two-dimensional -tilted 
bottles and four three-dimensional bottles, two of 
which are also tilted. The subject is asked to 
draw a line where the top of the water would be if 
the bottle were half full and then to place an "X^' 
in the part that contained the water. 
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3, FIGURAL INTERSECTIONS TEST (FIT) 

The figural intersection test is a group administered 
paper-and-pencil test in which subjects are required 
to place a dot in the intersection space of a varying 
number of geometrical figures. It was developed 
by Pascual-Leone and constitutes a figural analogue 
to Piaget's work on intersection of classes • In 
a series of unpublished studies, Pascual --Leone has 
shown the test to have a high degree of internal 
consistency (split-half reliability = •89) as well 
as being significantly related to tests of similar 
logical structure (Pascual-Leone and Smith, 1968). 
For example, it has- shown a high correlation with 
the WLT described above • Combined with the WLT, and 
ST described below, in the present context, it is 
taken as an index of developmental level. This^ 
relationship has been previously found in a series 
of unpublished studies by Pascual-Leone and Parkin- 
son at. York University, and De Avila (1971). 

SERIAL TASK (ST) 

The serial task (De Avila, 1971) is a short term 
memory task which is,. individually administered in 
two phases. First, subjects are pre-exposed to the 
stimulus materials used in a second testing phase. 
In t^ie pre-exposure or pre-training phase, each subject 
is shown a series of 10 different 3 5mm color slide 
transparencies of pictures depictings a donkey, house, 
airplane, etc. Subjects sit facing a screen situated 
on a wall six feet away. The 10 illustrations are 
presented by means of a Kodak 550 carousel slide ^ 
projector. To introduce the task, each subject is 
shown each figure and asked to give its name and 
color (i.e., "a yellow hat"). Following this ini- 
tial introductory phase and after the subject is 
able to correctly identify each figure ten times 
when presented in rapid random succession, the test- 
ing phase takes place. 

The test phase is conducted in a "free recall^' manner 
where, without any prior kno|i/ledge of the length of 
a list, the subject is asked to reproduce the list 
ignoring the order in which the individual items 
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are presented. Subjects are shown a series of 
individually presented figures terminated by a 
blank,, slide, and asked to tell the experimenter 
what they have seen. The exposure time for each 
individual slide is .750 msec. There is no require- 
ment that the sequence of the presentation be main- 
tained, or that the subject respond within a speci- 
fied period of time, or produce a predetermined niun- 
ber of , responses . The* child is simply asked to 
reproduce what he has seen using whatever lab^'^s 
are convenient. , 

There are seven sets of figures presented to each 
subject. These seven sets vary as to the number 
of stimuli witMn a series. The number of figures 
presented within a series, as well as the individual 
figures, are randomly varied. Finally, each illus- 
tration is presented no more than once within a 
series. 

The PAPI system uses these four measures to produce 
several types of information each of which has a partic- 
ular function, .^pirst, psychometric information on each 
test is produceci as a means of assuring that the tests 
^used in any giv;en cas6 have satisfactory levels of 
reliability and validity. This information is of primary 
importance to the researcher or program evaluator as a 
check against various types of problems associated with 
test administration, sampling procedures and so on. The 
second type of information produced by the PAPI system 
can be described as developmental-normative. These data 
provide normative information regarding the current 
developmental levels of the entire sample. Moreover, 
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these data can also be stored to serve as base-line 
data which can be compared across temporal (as in a pre- 
post design) or situational/program dimensions. In sum- 
.mary, these two types of information, the psychometric 
and the normative /developmental data, are geared, to the 
•needs of program evaluation and needs assessment at the 
administrative levels. 

The third type of information generated by the PAPI 
system consists of lists of recommended classroom activ- 
ities for each child tested* These activities are geared 
to providing children with experience in specific educa- 
tional activities thought to be important in the overall 
educational /program objectives. 

In generating the suggested classroom activities 
for each child, four factors are taken into account: 

y) The child *s achievement level with respect to 
the concepts measured by the CCS. 

2) The child *s developmental level as measured by 
the WLT, FIT and ST tests. 

3) The achievement level of the child's referent 
group with respect to the concepts measured 
by the CCS. 

H) The developmental level of the child's referent 



group as measured by the WLT, FIT and ST tests. 
An example of one computer printout which lists speci- 
fic classroom activities is provided below. Using the 
four factors listed above, the following example would 
indicate that : ' . 7 

1. The child's overall developmental test profile 
is similar to children of the referent group 
who have already acquired the concept. 

2. The child does not understand the concept. , 

It is then inferred that a specific set of activities 

are developmentally/educat ionally appropriate with respect 

to the concept of number as tested by the CCS. For example, 

this conjunction of conditions would lead to the following 

set of recommended classroom activities: 

CONCEPT OF NUMBER 

THIS CHILD DOES NOT HAVE A CONCEPT OF NUMBER, 
HOWEVER, HE IS READY TO LEARN. TRAIN HIM BY 
GOING THROUGH THIS SEQUENCE, USING CONCRETE 
ACTUAL OBJECTS ONLY. 

A. SORTING AND CLASSIFYING OBJECTS BY 

1. COLOR 

2 . TEXTURE 

3. SIZE 

H . WEIGHT 

5 . NUMBER 

6. WHATEVER OTHER ATTRIBUTES YOU 
CAN THINK OF 
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B. COUNTING ACTIVITIES, SUCH THAT THE CHILD 
LEARNS... 

SEQUENCE, NUMBERS INCREASE BY 
"^ME EACH TIME 

2 . INVAR lANCE 

A. PATTERN, 5 OBJECTS BUNCHED AND 
5 OBJpCTS IN. A ROW ARE BOTH 5 

B. OBJECTS, S ELEPHANTS AND 5 FLIES 
ARE BOTH 5 

3. SYMBOLS, LEARN THE ARBITRARY NAMES 
OF THE NUMERALS 

C. CONCEPT AND LANGUAGE OF EQUALITIES /JJD 
INEQUALITIES 

1. > . WHICH HAS MORE, IS BIGGER, 
IS LONGER, ETC. 

2. < . WHICH HAS LESS, IS SMALLER, 
IS SHORTER, ETC. 

' 3. = . WHICH HAS JUST AS MUCH, IS 

JUST AS BIG, IS JUST AS LONG, ETC. 

The above is an example of what is sent directly 
to the teacher. It can also be sent to the home so that 
the parents are made fully aware" of what the teacher is 
trying to accomplish with the child. In this way, 
posS'ibly with some guidance from the teacher, the parent 
can participate in the child's education while support- 
ing the teacher. 

, It should be noted that the PAPI system is designed 
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so that a child's peer or referent group can be de- 
signed according to any number of nominal descriptors such 
as grade, sex or ethnic group. Thus far, only chrono- 
logical age has been used because of the importance of 
the age variable in Piaget's theory. It should also 

be noted that, in the present description of PAPI/ system, 

/ 

the CCS IS treated as an achievement test rather/ than 
as a test of development as ^it was originally designed. 
Actually J any test of achievement' can be used Within 
the system as long as it is linguistically and culturally 
appropriate and is constructed in such a way that it can 
be used to generate suggested activities. ^ 

In summary, the PAPI system can be use^ with any 
child to produce educational program data, or data for 
program evaluatiot. The purpose of the present discussion 
has not been to suggest that the system is in any way 
a complete educational package. Further development 
is needed in creating test procedures and in training 
educators, on how to use this type of system. But given 
the problems associated with testing, the PAPI system 
suggests one approach to the. issue /which speaks to the 
different needs of people within the educational com- 
munity. 
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- SUMMARY 

The rr.aj or. thesis has peen that test publishers and 
the users of standardized llQ and summary- score achievement 
tests have failed to consider the problems associated 
with testing the minority child ♦ And since the results 
of these tests are used to determine the„ educational and, 
b^ extension, the economic and social future of school- 
age children, it behooves test publishers and the edu- 
cational community to take a harder look at the minority 
child's cultural background. It is the authors* opinion 
that consideration of these issues leads to thp con- 
clusion that the problem of testing cannot be solved by 
attem^pts to recreate standardized tests for minority 
children which ane based on old conceptions of intelli- 
gence and educational achievement. It is concluded that 
what is required is a radical change in the whole approach 
to testing and the generation of entirely different models 
of education and of testing. There is growing support 
for this conclusion (for example see McClelland, 1973)* 
The PAPI system is just one step in an attempt to move 
in a new direction. 
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